Resource Bricolage for Parallel Database Systems
نویسندگان
چکیده
Running parallel database systems in an environment with heterogeneous resources has become increasingly common, due to cluster evolution and increasing interest in moving applications into public clouds. For database systems running in a heterogeneous cluster, the default uniform data partitioning strategy may overload some of the slow machines while at the same time it may under-utilize the more powerful machines. Since the processing time of a parallel query is determined by the slowest machine, such an allocation strategy may result in a significant query performance degradation. We take a first step to address this problem by introducing a technique we call resource bricolage that improves database performance in heterogeneous environments. Our approach quantifies the performance differences among machines with various resources as they process workloads with diverse resource requirements. We formalize the problem of minimizing workload execution time and view it as an optimization problem, and then we employ linear programming to obtain a recommended data partitioning scheme. We verify the effectiveness of our technique with an extensive experimental study on a commercial database system.
منابع مشابه
Heuristic Optimization of Speedup and Benefit/Cost for Parallel Database Scans on Shared-Memory Multiprocessors
Previous work on parallel database systems has paid little attention to the interaction of asynchronous disk prefetching and processor parallelism. This paper investigates this issue for scan operations on shared–memory multiprocessors. Two heuristic methods are developed for the allocation of processors and memory to optimize either the speedup or the benefit/cost ratio of database scan operat...
متن کاملLearning to Grow: Dynamic Capabilities in New Technology-based Firms
In this conceptual paper we consider resource acquisition and configuration in new technologybased firms. Essentially, we propose that bootstrapping and bricolage are essential dynamic capabilities in such firms. We argue these embedded routines, which are connected by learning processes, provide a way to remain lean and agile while managing innovation in a dynamic environment. Bootstrapping en...
متن کاملAnalytical response time estimation in parallel relational database systems
An analytical approach to response time estimation in parallel relational database systems has been developed. It is based on a representation of database activity, in which queries are mapped to low-level patterns of resource consumption, capturing the execution logic of relational operators and mechanisms such as pipelined and partitioned execution. Resource usage profiles are mapped to open ...
متن کاملMaking Mhealth Happen for Health Information Systems in Low Resource Contexts
The paper offers a reference typology for large scale mHealth solutions in lowresource contexts. The proposed typology is produced through action research engagement with various mHealth initiatives within primary health care; including one fully deployed large-scale solution, medium-sized pilot studies and projects currently being implemented. Our investigations are informed by theoretical ass...
متن کاملEfficient Resource Utilization in Shared-Everything Environments
E cient resource usage is a key to achieve better performance in parallel database systems. Up to now, most research has focussed on balancing the load on several resources of the same type, i.e. balancing either CPU load or I/O load. In this paper, we present oating probe, a strategy for parallel evaluation of pipelining segments in a shared-everything environment that provides dynamic load ba...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- PVLDB
دوره 8 شماره
صفحات -
تاریخ انتشار 2014